Skip to main content

Fix Broken Links Web Crawls

These crawls are part of an effort to archive pages as they are created and archive the pages that they refer to. That way, as the pages that are referenced are changed or taken from the web, a link to the version that was live when the page was written will be preserved.

Then the Internet Archive hopes that references to these archived pages will be put in place of a link that would be otherwise be broken, or a companion link to allow people to see what was originally intended by a page's authors.

The goal is to fix all broken links on the web. Crawls of supported "No More 404" sites.

65,559
RESULTS
rss


Media Type
4
collections
65,555
web
Topics & Subjects
65,555
crawldata
34,859
no404
17,806
wordpress
16,302
wikipedia
751
search
1
GDELT
More right-solid
Collection
65,559
Fix Broken Links Web Crawls
65,559
Web Crawls
30,696
GDELT
17,806
Wordpress Blogs and the Pages They Link To
15,662
Wikipedia Near Real Time (from IRC)
2,265
Internet Archive Web Crawls
More right-solid
Creator
65,555
internet archive
SHOW DETAILS
up-solid down-solid
eye
Title
Date Reviewed
Creator
GDELT
web
eye 55
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jul 1 01:03:44 PDT 2016 to Thu Jun 30 20:37:54 PDT 2016.
Topic: crawldata
GDELT
web
eye 68
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed Jul 6 01:50:46 PDT 2016 to Tue Jul 5 20:25:10 PDT 2016.
Topic: crawldata
GDELT
web
eye 38
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Tue Jul 5 13:27:48 PDT 2016 to Tue Jul 5 06:34:15 PDT 2016.
Topic: crawldata
Internet Archive crawldata from feed-driven Wikipedia Outlinks Crawl, captured by crawl345.us.archive.org:no404 from Sat Nov 1 13:44:49 PDT 2014 to Sat Nov 1 07:32:20 PDT 2014.
Topics: no404, wikipedia, crawldata
GDELT
web
eye 114
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Mon Sep 19 01:29:43 PDT 2016 to Sun Sep 18 22:43:31 PDT 2016.
Topic: crawldata
GDELT
web
eye 125
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Mar 4 16:30:15 PST 2016 to Fri Mar 4 08:53:58 PST 2016.
Topic: crawldata
GDELT
web
eye 228
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Thu Aug 11 03:24:08 PDT 2016 to Thu Aug 11 00:06:34 PDT 2016.
Topic: crawldata
GDELT
web
eye 91
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Tue Oct 11 22:07:47 PDT 2016 to Tue Oct 11 15:58:23 PDT 2016.
Topic: crawldata
GDELT
web
eye 290
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri May 13 04:32:21 PDT 2016 to Thu May 12 21:53:54 PDT 2016.
Topic: crawldata
GDELT
web
eye 110
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed Jun 22 03:46:55 PDT 2016 to Tue Jun 21 22:11:13 PDT 2016.
Topic: crawldata
GDELT
web
eye 205
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Thu Sep 22 18:06:07 PDT 2016 to Thu Sep 22 13:02:42 PDT 2016.
Topic: crawldata
GDELT
web
eye 151
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Tue Aug 30 21:32:24 PDT 2016 to Tue Aug 30 16:14:28 PDT 2016.
Topic: crawldata
GDELT
web
eye 129
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Fri Nov 11 06:03:32 PST 2016 to Fri Nov 11 00:36:44 PST 2016.
Topic: crawldata
GDELT
web
eye 96
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jul 1 00:20:23 PDT 2016 to Thu Jun 30 20:04:47 PDT 2016.
Topic: crawldata
GDELT
web
eye 154
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sat Sep 3 01:08:51 PDT 2016 to Fri Sep 2 20:07:01 PDT 2016.
Topic: crawldata
GDELT
web
eye 147
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jul 1 04:11:54 PDT 2016 to Thu Jun 30 23:21:42 PDT 2016.
Topic: crawldata
GDELT
web
eye 309
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Sun Mar 6 05:28:41 PST 2016 to Sat Mar 5 23:25:04 PST 2016.
Topic: crawldata
GDELT
web
eye 98
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Mon Aug 22 06:28:45 PDT 2016 to Mon Aug 22 00:49:05 PDT 2016.
Topic: crawldata
GDELT
web
eye 135
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl409.us.archive.org:gdelt from Tue Sep 27 02:28:56 PDT 2016 to Tue Sep 27 00:09:11 PDT 2016.
Topic: crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl897.us.archive.org:no404 from Sun Jul 23 06:19:31 PDT 2017 to Sun Jul 23 07:55:25 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl895.us.archive.org:no404 from Sun Jul 23 11:48:55 PDT 2017 to Sun Jul 23 12:45:40 PDT 2017.
Topics: no404, wordpress, crawldata
GDELT
web
eye 167
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Mon May 30 22:18:20 PDT 2016 to Mon May 30 16:42:34 PDT 2016.
Topic: crawldata
GDELT
web
eye 99
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Jul 15 18:51:25 PDT 2016 to Fri Jul 15 13:20:32 PDT 2016.
Topic: crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Wed Nov 27 07:55:24 PST 2013 to Wed Nov 27 01:46:32 PST 2013.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Tue Apr 29 09:48:26 PDT 2014 to Tue Apr 29 12:50:49 PDT 2014.
Topics: no404, wordpress, crawldata
GDELT
web
eye 202
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sun Jun 26 22:03:41 PDT 2016 to Sun Jun 26 18:26:25 PDT 2016.
Topic: crawldata
GDELT
web
eye 237
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Mon May 2 03:42:55 PDT 2016 to Sun May 1 22:10:18 PDT 2016.
Topic: crawldata
GDELT
web
eye 95
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed May 4 15:37:08 PDT 2016 to Wed May 4 08:47:59 PDT 2016.
Topic: crawldata
GDELT
web
eye 229
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Mon Jun 13 07:10:14 PDT 2016 to Mon Jun 13 02:06:11 PDT 2016.
Topic: crawldata
GDELT
web
eye 1,294
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Thu Mar 17 10:51:11 PDT 2016 to Thu Mar 17 05:11:44 PDT 2016.
Topic: crawldata
GDELT
web
eye 68
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri Mar 18 17:08:17 PDT 2016 to Fri Mar 18 10:12:05 PDT 2016.
Topic: crawldata
GDELT
web
eye 1,387
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sat Apr 23 10:24:58 PDT 2016 to Sat Apr 23 03:51:51 PDT 2016.
Topic: crawldata
GDELT
web
eye 518
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Mon May 2 18:03:03 PDT 2016 to Mon May 2 12:01:14 PDT 2016.
Topic: crawldata
GDELT
web
eye 408
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Fri May 6 08:53:56 PDT 2016 to Fri May 6 02:16:40 PDT 2016.
Topic: crawldata
GDELT
web
eye 395
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Sun May 8 11:11:16 PDT 2016 to Sun May 8 04:35:58 PDT 2016.
Topic: crawldata
GDELT
web
eye 440
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Wed May 11 19:30:12 PDT 2016 to Wed May 11 14:11:34 PDT 2016.
Topic: crawldata
GDELT
web
eye 250
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Mon May 9 04:54:37 PDT 2016 to Sun May 8 23:04:11 PDT 2016.
Topic: crawldata
GDELT
web
eye 283
favorite 0
comment 0
Internet Archive crawldata from feed-driven GDELT Crawl, captured by crawl816.us.archive.org:gdelt from Tue May 31 12:05:37 PDT 2016 to Tue May 31 05:51:27 PDT 2016.
Topic: crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Mon Mar 6 00:43:08 PST 2017 to Sun Mar 5 18:40:47 PST 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Mon Mar 6 03:31:32 PST 2017 to Sun Mar 5 22:24:17 PST 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Sat Mar 11 18:15:30 PST 2017 to Sat Mar 11 11:59:12 PST 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Sat Mar 11 20:39:51 PST 2017 to Sat Mar 11 14:01:30 PST 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Sat Mar 11 20:46:03 PST 2017 to Sat Mar 11 14:26:17 PST 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Thu Mar 9 23:52:30 PST 2017 to Thu Mar 9 17:32:16 PST 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Fri Mar 10 10:32:11 PST 2017 to Fri Mar 10 07:37:03 PST 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Fri Mar 10 01:16:50 PST 2017 to Thu Mar 9 20:32:19 PST 2017.
Topics: no404, wordpress, crawldata
Wordpress Blogs and the Pages They Link To
web
eye 29
favorite 0
comment 0
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from to Sat Mar 11 20:35:05 PST 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Fri Mar 10 04:32:19 PST 2017 to Thu Mar 9 23:36:28 PST 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Sun Mar 12 16:25:41 PDT 2017 to Sun Mar 12 10:27:39 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Sun Mar 12 23:01:53 PDT 2017 to Sun Mar 12 17:20:07 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Mon Mar 13 06:37:02 PDT 2017 to Mon Mar 13 01:03:33 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Mon Mar 13 05:30:23 PDT 2017 to Sun Mar 12 23:44:17 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Mon Mar 13 07:30:38 PDT 2017 to Mon Mar 13 02:29:07 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Sun Mar 26 07:52:07 PDT 2017 to Sun Mar 26 02:03:48 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Sun Mar 26 10:38:25 PDT 2017 to Sun Mar 26 05:31:45 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Tue Mar 28 12:45:13 PDT 2017 to Tue Mar 28 07:41:30 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Tue Mar 28 11:36:13 PDT 2017 to Tue Mar 28 06:41:24 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Tue Mar 28 08:28:48 PDT 2017 to Tue Mar 28 02:46:12 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Mon Mar 27 12:04:41 PDT 2017 to Mon Mar 27 06:43:41 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Thu Mar 30 05:17:01 PDT 2017 to Wed Mar 29 23:56:35 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Thu Mar 23 05:49:07 PDT 2017 to Thu Mar 23 00:33:08 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Wed Mar 29 23:06:51 PDT 2017 to Wed Mar 29 17:36:36 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Thu Mar 30 03:16:17 PDT 2017 to Wed Mar 29 22:01:41 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Thu Mar 23 03:04:50 PDT 2017 to Wed Mar 22 21:21:46 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Fri Mar 31 22:51:56 PDT 2017 to Fri Mar 31 18:30:21 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Fri Mar 24 19:00:42 PDT 2017 to Fri Mar 24 13:20:26 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Thu Mar 30 20:30:05 PDT 2017 to Thu Mar 30 15:46:46 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Sat Mar 25 02:05:42 PDT 2017 to Fri Mar 24 21:21:43 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Thu Mar 30 08:16:02 PDT 2017 to Thu Mar 30 03:42:25 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Thu Mar 30 11:37:57 PDT 2017 to Thu Mar 30 05:41:36 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Thu Apr 6 13:23:32 PDT 2017 to Thu Apr 6 09:27:01 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Wed Apr 5 06:15:43 PDT 2017 to Wed Apr 5 01:36:57 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Wed Apr 5 12:40:26 PDT 2017 to Wed Apr 5 07:48:05 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl818.us.archive.org:no404 from Fri Apr 7 15:39:39 PDT 2017 to Fri Apr 7 10:22:06 PDT 2017.
Topics: no404, wordpress, crawldata
Internet Archive crawldata from feed-driven WordPress Crawl, captured by crawl344.us.archive.org:no404 from Wed Sep 14 18:31:00 PDT 2016 to Wed Sep 14 12:38:06 PDT 2016.
Topics: no404, wordpress, crawldata